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ABSTRACT 

Province-wide school-leaving examinations had been an 
integral part of education in British Columbia (Canada) until 1973 

^ W6re abandoned for a P eri °d °f 10 years and reintroduced in 
iy»J. This study investigated the effects of the grade 12 
examinations in secondary science, and of science assessments 
conducted in grade 10. Researchers observed and videotaped classes of 
2k teachers from 2 different school districts. Teachers and 
administrators were also interviewed. Results indicate that final 
examination at the grade 12 level did have a strong effect on 
teaching at that level and these examinations as they are conducted 
are not mere indicators of performance. Specifically, several 
teachers of grades 8 and 10 reported that the grade 12 exams had 
little or no effect on their teaching, while grade 12 teachers 
consistently reported the opposite. Moreover, any exemplary science 
teaching observed by the researchers took place in the lower grades; 
by contrast, straight lecturing, non-involvement by students, and an 
emphasis on tests and testing characterized the grade 12 classroom 
where the prevalent question heard was, "will this be on the exam?" 
The study concludes that the grade 12 examination tends to freeze' 
innovative teaching practices, that this augurs badly for improved 
teaching in the high schools, and that improvement can only come from 
teachers who feel some freedom to experiment and take risks. (PR) 
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Background 

The issue of national testing has featured prominently in political and educational 
discussions in both the United States and Canada in recent years. In Canada, a consortium of 
provincial Ministries of Education has instituted a pilot project designed to create national 
tests of achievement of [to be completed]]. In the USA, the federal executive branch and 
private groups have advocated a national examination for elementary and secondary students, 
and these proposals have generated strong controversy. 

The United States Congress recently commissioned a report on testing to help legislators 
understand the implications of a introducing a national testing program. That report (United 
States General Accounting Office, 1993) suggests that the cost of a national test modeled on 
common multiple-choice tests would be about 51 60 million annually; one using performance- 
based testing would cost in the order of S330 annually. Start-up development costs would add 
another SI 00 million. Tne report claims that, based on the responses to a survey of 368 local 
and state school administrators and testing directors, the benefits would outweigh the costs. 
Such benefits include diagnosis and evaluation information, positive classroom outcomes such as 
currriulum alignment with standards, and positive products of the assessment process such as 
clear stand ards^better public understanding^and teacher edification (p. 52). 

The GAO committee explicitly confined its investigation to questions concerning the 
nature and extent of current standardized testing, costs, and the effect of new national tests on 
those factors. It did not examine instructional or pedagogical issues, and it did not include 
classroom teachers in its survey. Kad it done, it might have considered a recent study by Smith 
(1991) who identified a number of undesirable effects of standardized testing on elementary 
teachers. These were centred on the effects of publication of test scores, belief in the invalidity 
of the tests, the emotional impact of testing on children, reduced instructional time, narrowing 
of curriculum, and deskilling of teacher work. In an important Canadian study, Calder (1990) 
conducted a study for the Alberta Teachers' Association to assess the impact of "diploma" 
examinations designed for school graduation, and concluded: 
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Although there is no consensus, and teachers see many ways in which the examinations 
negatively affect the teaching-learning process, they generally favor' the retention of 
the diploma examinations. The examinations are seen more as fulfilling a political 
evaluative function than being of educational value. Although teachers report more 
negative impact of the examinations, they seem to be reluctant to dispense with them. 
(p.3) 

In Canada, each province determines its own curriculum and ensures compliance through 
a central Department or Ministry of Education. Half the provinces (British Columbia, Alberta, 
Saskatchewan, Quebec, and Newfoundland) require students to pass centrally-set school- 
leaving examinations as a condition for school graduation. Many provinces also have in place 
assessment programs to monitor general levels of achievement in various subject areas. Some of 
these involve standardized tests; others use locally developed curriculum-based instruments. 

In this paper, we have differentiated three forms of achievement testing common in 
Canada. "Syllabus-based tests" are those created to match a clearly defined curriculum and 
are generally used to assess individual performance for the purpose of school graduation. 
"Standardized achievement tests" consist of "...a fixed set of test items designed to measure a 
clearly defined achievement domain, specific directions for administering and scoring the test, 
and norms based on representative groups of individuals like those for whom the test was 
designed (Gronlund, 1981, p. 304)." Moreover, on such tests, the items will have been pretested 
and selected on the basis of difficulty, discriminating power, and relationships to a defined set 
of specifications, and will in all probability be cast in a multiple-choice format. "Program 
assessment instruments" are designed to determine general levels of achievement of a cohort of 
students. Such instruments will be constructed to match a broad curriculum that may extend 
beyond that covered in school, and may make use of samples of students. 

We see three underlying purposes for "large-scale testing," that is, those testing 
activities that are not under the control of the classroom teacher or the school. Those are 
selection, accountability, and control. Selection is designed to sort individuals for future 
education or vocation. Accountability is designed to identify agencies within the educational 
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system that may be held responsible for the failure or success of students, usually as groups 
rather than individuals. Control centres on the issue of who is to determine what students are 
to learn and how that will be accomplished. 

The Case of British Columbia 

If we accept that testing is an inevitable concomitant of instruction in North American 
schools, what would constitute the ideal situation? Without considering the quality of the 
tests themselves, each form of test would be used differently. Shepard (1989) cautions that 
"the differences between accountability and insL actional assessment are so fundamental and 
necessary that it may not be desirable to merge the two purposes" (pp. 7-8). Thus, standardized 
tests would be used to diagnose individual students' performance, as they were originally 
intended to do. Such tests would be used, for example, to help make decisions about assigning 
students to learning assistance or gifted programs. Syllabus-based tests would be used to 
determine the success of individual students in the school program. Program assessment 
instruments would be used to evaluate the effectiveness of the educational system as a whole 
and, perhaps, that of major sub-units. 

Shepard (1989) also postulates other desirabie characteristics of an effective 
evaluation program. She points out the need for teachers to have a say in decisions regarding 
individuals: "Although single teacher tests are probably less reliable (in a statistical sense) 
than a one-hour standardized test, the accumulation of data gathered about individual pupils 
in the course of a school year has much more accuracy" (p. 7). For large-scale assessment she 
recommends the use of matrix sampling, as in the California Assessment, and a "cycle of topics 
so different subjects could be comprehensively assessed each year" (p. 9). 

If one examines the structure of assessment in British Columbia, one might conclude that 
this province has in place all the recommended ingredients for effective assessment. Let us 
summarize each of the three testing modes. Although it has been difficult to find specific 
information on the use of standardized tests in the province, we believe that school districts, in 



general use such tests for diagnostic and within-schooi placement purposes. At worst, districts 
have a policy to administer standardized tests each year at certain grade levels, but the results 
are simply entered into the students 1 files, with no follow-up activity. The lack of information, 
generally, would indicate that the results of standardized tests are of little concern to teachers 
and administrators either for promotion or program evaluation. 

Syllabus-based tests are in place fo; virtually all subjects at the Grade 12 level, and 
only at that level. Ostensibly, then, the examinations serve the twin purposes of ensuring 
comparability across schools and reassuring the public that educational standards are being 
upheld. The Ministry of Education tests contain a mix of multiple-choice and open-ended 
questions, and the Ministry clearly, and publicly, indicates the examinable body of knowledge 
through its curriculum guides and tables of specifications for examinations. Moreover, teachers 
have direct input into the grades awarded because the final mark for the student is based 40 
percent on the Ministry examination and 60 percent on a teacher-awarded mark. At all other 
grade levels, grades and promotion decisions are based entirely on teacher-awarded marks. 

The Ministry's assessment program assesses subject areas on a cyclical basis. In Science, 
assessments occurred in 1978, 1982, 1986, and 1991. The 1991 science assessment was designed to 
address four aspects of student outcomes: affective, processes and skills, knowledge, and higher 
level thinking. Although the first three assessments were based on the population of students 
at the three grade levels using only multiple-choice items, the 1991 assessment was different. 
The Ministry now is moving toward alternative means of assessment, including observations and 
interviews (see, for example, Bateson, Erickson, Gaskell, & Wideen, 1991). No longer are 
results to be reported at the school and district levels, and the 1991 Science assessment adopted 
matrix sampling of items and students to allow greater curriculum coverage at the expense of 
local reporting. 

Structurally, then, the system in British Columbia complies with many 
recommendations of test critics. But, what is the underlying reality? As previously mentioned, 
for example, we know little of how standardized test results are used. Are they administered 
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simply because it is a policy to do so? If they become part of a student's permanent record, do 
they play any direct or indirect role in teachers* or counsellors 1 recommendations for promotion 
or program? And why do districts adopt different policies regarding standardized testing? 

With regard to final examinations, the conventional wisdom seems to be that teachers 
teach to the test, at the expense of other instructional goals. Why should this be the case when 
more than half the student's mark is based on teacher evaluations? In their own evaluations, 
do teachers use instruments similar to those developed by the Ministry, or do they use greater 
variety? Do teachers evaluaie students on objectives other than knowledge or skills, as 
Popham (1987) suggests they will using his measurement-driven instruction model? Do they 
value and teach the processes of developing an understanding of our world through science and 
attempt to evaluate such student understanding? Would they do things differently if external 
examinations did not exist? 

With regard to the assessment program in science, we have some information to suggest 
that teachers have little understanding of how the program operates and do not read the 
assessment reports. How seriously, then, do students and teachers view the annual assessment 
program? Are the results reliable and valid indicators of student achievement, particularly at 
affective and higher cognitive levels? What policy decisions have been made at the 
provincial, district, and school levels on the basis of assessment results? Have individual 
teachers been evaluated using these results? Do teachers even care about such matters? 

If we are to understand fully the relationship between large-scale testing and science 
education, it is necessary to go beyond the rhetoric of advocates and opponents of testing. We 
have found that much of the critical literature is based simply on the analysis of multiple- 
choice test items (e.g., Hoffman, 1962; Morgenstern k Renner, 1984). On the other hand, in a 
circular argument, advocates use test results themselves to justify test-driven programs (see, 
e.g., Popham et al., 1985). Where the influence of testing on teacher practices is described, 
that information is based on surveys and interviews (e.g., Darling-Hammond & Wise, 1985). 
All these sources are at least one step removed from what actually occurs in the classroom. The 
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study reported in this article contains a much closer look at teachers as they conduct their 
classes, and describes their outlook on large-scale testing, as well as the role their own 
evaluation practices play in their teaching. 

Purpose 

The study was designed to address the following questions: 

1. What has been the impact of large-scale testing on curriculum policies at the district and 

school levels and on science teaching practices at the classroom level? 

2. What factors in addition to large-scale testing influence teaching practices and the 

perceptions of the various people involved in the educational system with regard to science 
teaching? 

3. What has been the general impact of large-scale testing in relation to other factors? 

4. How can the impact of large-scale testing be explained in terms of social change theory, 

curriculum decision-making, and the philosophy of science teaching? 

Methodology 

Province-wide school-leaving examinations had been an integral part of education in 

British Columbia until 1973 when they were abandoned for a period of 10 years. In 1983, the 

government reinstated a system of centrally set and marked examinations 

to ensure that grade 12 students meet consistent provincial standards of achievement in 
the academic subjects. The examination program will also ensure that graduating 
students from all schools in the province will be treated equitably when applying for 
admission to universities and other post-secondary institutes. An additional purpose of 
this program is to respond to strong public concerns for improved standards of education, 
(British Columbia Ministry of Education, 1983, p. 6) 

Given that the province had experienced a ten-year period with no final province- 
wide examinations at the 12th grade level, followed by their reintroduction in 1983, and given 
that it also had developed a large-scale assessment program, a unique opportunity existed for 
an impact study of the effects of large-scale testing in British Columbia. Hammersley and 
Scarth (1990), for example, who found some evidence indicating that humanities teachers used 
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different instructional practices depending on whether the course was examinable or not, 
suggest that such an occasion might be. used to assess the general "washback" effect of 
examinations within a school system. 

We designed the study initially to examine the impact of large-scale testing on science 
teaching at the district, school, and classroom levels; to explore the more general effects of a 
clearly mandated policy initiative on change and decision-making at different levels of a 
social system; and to address a number of philosophical issues related to the role of large-scale 
testing in education. Although outside the immediate scope of our initial undertaking, we soon 
became aware, as a result of work done during the pilot phase, that other important factors, 
such as teaching styles, district policies, quality of teacher training, availability of inservice, 
crowded curricula, and teacher background and perceptions of science also affected the quality 
of science teaching. Moreover, it was clear that it would be very difficult to extricate the 
effects of large-scale testing from these influences. Hence, to make the study as inclusive as 
possible, we decided to extend it to include other factors and thereby paint a more complete 
picture of science teaching at the secondary level. Thus, while the study was specifically 
focused on the impact of large-scale testing, its scope was broadened to address other research 
questions and issues. (A more comprehensive discussion of the testing controversy and 
methodology can be found in Wideen, O'Shea, Pye, Sherwood, & Ivany, 1991) 

The central issue of the study concerned the effects of province-wide Grade 12 final 
examinations in secondary science (Biology, Chemistry, and Physics), and of science assessments 
conducted in Grade 10. To examine the effects of external testing across the secondary school, we 
collected data from teachers and students in Grades 8, 10, and 12 and from principals, district 
staff, and Ministry officials as well. In the first year we conducted intensive case studies of two 
school districts and in the second year we expanded the study to ten other districts. 

Two school districts participated in the first phase of the study, conducted during the 
school year of 1988-89. The districts were chosen to represent a large metropolitan school area 
and a smaller, semi-rural one, and both were amenable to having researchers enter their 
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schools. In this phase, members of the research team observed and audio-taped 4 or 5 science 
classes of each of 24 teachers (12 in each district; 3 in each of Grades 8, 10, and 12) in 6 schools in 
one district and 5 in the other. The schools were chosen by district officials, and the principal 
in each school identified teachers who agreed to allow the research team into their 
classrooms. We also interviewed various members of the educational community, including the 
teachers in the classrooms observed, principals and science coordinators in the school, district 
personnel, and students. Questions on the teacher interviews included the teacher's 
understanding of science (world view) and science teaching; background, including the number of 
years he or she had been teaching and his or her preparation in science; teaching style; policy, 
procedure and practice at the school, and district levels; and the influence of other external 
factors such as curriculum guides, parental expectations, textbooks, post-secondary institutions' 
expectations, and the influence of colleagues. 

Each researcher was assigned primary responsibility for a number of teachers. That 
person undertook the majority of observations for a given teacher, and was responsible for 
writing up a summative "teacher profile." The profile consisted of a two-part description of 
the teacher and his or her classes. Based on a printed transcription of the interview audiotape, 
the researcher constructed a portrait of the teacher under the following headings: background, 
world view, teaching, influences on teaching, and testing. The second section contained a 
general description of the classroom followed by a one-page summary of each class observed, 
including a brief interpretive comment. 

In some cases, researchers had spent considerable time in a school and had interacted 
with department heads, vice-principals, principals, librarians, and other staff members. 
Where possible, school profiles were constructed along lines similar to, but not as extensive as, 
the teacher profiles. 

In the second phase of the study, conducted in 1989-90, we modified the observation 
instruments, adopting Stodolsky's (1988) definitions of classroom practice. The interview was 
refined to capture in more detail the issues that appeared to inform and influence science 
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teaching. To determine the Phase Two sample, the 75 provincial districts were stratified into 
six geographical regions. Six districts were randomly selected from each region, and four more 
were drawn from the regions with the largest populations. We contacted the superintendent in 
each of the ten districts and received permission to approach the various schools in the district. 
Schools were chosen at random in the larger districts, while in the smaller districts we had to 
accommodate to the exigencies of school size and the availability of teachers willing to take 
part in the study. Individual members of the research team observed a single science class of 
each teacher and followed this observation with an inJepth interview with the teacher. 
Data were obtained from a total of 56 teachers in Phase Two. 

In this phase, a single researcher interviewed a teacher and observed one of his or her 
classes. Although the observation and interview were audio-taped, neither was transcribed, 
and the researcher produced a two-page teacher summary based on his or her analysis of the 
transactions on the audio-tapes. The observation coding sheets and the interview summaries 
formed the basis for data analysis in Phase Two. 

The wealth of qualitative data contained in the teacher interviews served as the major 
source to gain insight into the phenomenon of science teaching at an individual level. To paint 
a more comprehensive picture and search for explanations we developed a procedure to quantify 
information related to a number of variables prominent in the data. Twelve teacher variables 
based on the interview questions were chosen. These variables portrayed information about an 
individual teacher's background and experience as well as that teacher's world view and 
approaches to teaching: 

• number of years of science teaching experience, 

• level of science education, 

• inservice background, 

• world view, 

• importance of teaching science in schools 

• desired student outcomes, 
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• approach to teaching science, 

• desire for change in science education, 

• the influence of external factors on teaching, 

• importance of large-scale testing, 

• impact of large-scale testing on teaching practices, 

• importance of classroom testing, 

For each variable a five-point scale was developed and teachers were placed on the 
scale based on their responses to interview questions related to these variables. For a number of 
variables, typical responses were identified during the rating process for each point on the 
scale. For example, the variable "impact of large scale testing on teaching practices" consisted 
of five points ranging from "no impact" to "strong impact" with the following typical response 
associated with each point: 

1. "No impact." 

2. 'The Grade 12 exams have put an additional responsibility on teachers to cover the Grade 

8 curriculum/ 1 

3. "I give more multiple-choice and fewer rigourous essay questions, and also cover more of 

the curriculum." 

4. "Teaching was more relaxes .nd there was more time to 'play with' labs before large-scale 

testing was introduced. Large-scale testing discourages students who have an interest in 
science but don't want to face government exams/' 

5. "I used to do an integrated project with the English teacher which is no longer possible. I 

did more project work, more library work, more open-ended work, and more programs 
involving the community." 

Results 

This section provides a descriptive summary of the data. We begin with an overview 
of where the policy change appeared to have its greatest effect. We then provide the data 



ERLC 



12 



11 

collected relative to science teaching as we observed it in classrooms. We also report on those 
factors which influence teaching as reported by the teachers in the interviews. Finally, we 
report the data we collected that directly relates to the question: what has been the impact of 
large-scale testing on the instructional activity of teachers? 
Policy Implications of Large-Scale Testing 

It had been assumed that large-scale testing would potentially have effects at the 
district, school and classroom levels. During the course of our data gathering, and particularly 
during the first phase in which we worked exclusively in two districts, we sought information 
about how the introduction of large-scale testing affected policy at those three levels. We 
looked for evidence that districts or schools undertook some policy initiatives either verbally 
or in writing that would smooth the transition to final examinations. It soon became evident 
that we were attempting to document a non-event. Neither districts nor schools produced any 
evidence of written or spoken policy that had resulted from the introduction of final 
examinations. Final examinations it appeared had become a direct link between the Ministry 
of Education and teachers, with the district and schools acting as observers of the events rather 
than as participants. 

The main role that we saw schools and districts play in connection with final 
examinations at the Grade 12 level evolved over time and generally took the form of 
comparative reporting of school and district results. This practice varied from district to 
district and from school to school. In one school we were shown graphs where each teacher's 
results on the final examination were represented on a year-to-year basis and the vice- 
principal routinely shared these data with the teachers involved. In other cases both schools 
and districts assumed a more laissez-faire attitude toward the results from the examinations, 
leaving interpretations to the teachers themselves. 

The most immediate impact, however, occurred at the classroom level. Thus, much of 
this report deals with the impact of large-scale testing on the work of teachers. Other effects 
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on curriculum and practices, at the schools and districts where they did occur, appeared to be of 

a more subtle nature. 

Classroom Practices in Science Teaching 

Because the study primarily sought to examine the impact of large-scale testing on 
instruction, we saw it as important to begin this section with a description of our observations of 
the teaching we observed and how the teachers saw their own teaching. This section then 
begins with an aggregated report of the teaching we recorded on timed coding sheets as we 
observed teachers in their classrooms. The second section reports how the teachers described 
their own teaching; these data represent a quantifying of qualitative data. 

Classroom observations 

We first described the science teachers' instructional format in terms of the percentage 
of time they spent on each of the activities as shown in Table 1. There it can be seen that the 
most consistent and prevalent mode of instruction consisted of having the students engage in 
seatwork. Lecturing and instructing were also prominent, particularly in the upper grades. The 
amount of discussion and laboratory work appeared to decrease with increasing grade levels. 
However, relying solely on the mean percent of time devoted to each activity across the classes 
tended to be misleading because the number of classes varied in which the activity was 
actually observed. 

[Insert Table 1 about here] 

To show more clearly the activities we observed, we cast the information in the form of 
box-and-whisker plots, as indicated in Figure 1. To interpret the diagrams, one needs to know 
that the + indicates the location of the median class, the left side of the box indicates the 25th 
percentile, and the right side of the box the 75th percentile. The dashed line shows the 
expected range of the upper 25 percent of observations, based on the length of the box. The dot 



9 

ERIC 



14 



13 

on the dashed line shows the highest observed value within that range. Any values outside 
that range, that is, to the right of the dashed line are outliers, indicated by x. An extreme 
outlier is shown as X. Where no boxes are shown, fewer than 75% of the classes engaged in that 
activity, and in such cases the dots indicate the percent'of class time for each class. 

(Insert Figure 1 about here) 

To illustrate, consider the activity "giving instruction" at the Grade 8 level where we 
observed 17 classes. The median percentage of time teachers engaged in this activity was 13. 
The first quartile (25th percentile) was zero, and the third was 17 percent. The largest percent 
within the expected range was 35. In one unusual class, the teacher devoted 55% of the time to 
giving instructions. The raw data on which the diagram is based consisted of the following 
percentages: 0, 0, 0, 0, 0, 8, 11, 11, 13, 13, 14. 16, 17, 22, 25, 35, 55. 

In general, because no box shows a whisker attached to its left side, at least a quarter of 
teachers at each grade level did not use each activity. In most cases, because the median 
coincides with the y-axis, at least 50 percent of teacher did not use that activity. 

It is clear, as suggested by the mean values shown in Table 1, that "seatwork," used by 
over 50% of teachers at each grade level, was the most popular instructional format. In this 
format, the students worked individually at their desks on a common task. The median percent 
of time at each grade level was around 20% and the highest values around 70%. In a quarter of 
the classes we observed, teachers used seatwork for over 40% of the class period. 

The next most frequently used format was "teacher instructing" which involved 
relatively short exchanges between the teacher and students, usually where the teacher called 
on individual students to answer questions. The use of this mode increased from Grade 8, where 
more than half the teachers did not use this approach, to Grade 12 with a median of 18%. In 
one unusual Grade 8 class the teacher used this mode 70% of the time; in Grade 12 one teacher 
used it for the entire class. 



15 



14 

"Lecturing," where the teacher talked most of the time to impart information, ideas or 
skills, was used by over half the Grade 12 teachers, and for up to 80% of the time in two cases. 
One extreme Grade 10 teacher lectured for about 90% of the period. Few Grade 8 teachers 
lectured, and then it was up to about 20% of the time. 

A popular mode in instruction at the Grade 8 level, although used by less that 50% of 
teachers, consisted of "laboratory work." In three cases, virtually the entire class was given to 
this activity. Few Grade 10 and 12 classes were so engaged. "Giving instructions," where the 
teacher told the students the plan for an activity, was used by more than half the Grade 8 
teachers, and this may be as a result of the higher laboratory activity at this grade level. The 
incidence of the teacher giving instructions decreased with increasing grade level in seeming 
inverse proportion to the laboratory work. 

The use of "discussion," where the teacher led a discussion and tried to elicit opinions 
and ideas rather than right answers from students, was more prevalent at Grades 8 and 12 than 
at Grade 10. At all levels, however, fewer than half the teachers engaged in this mode. 

"Checking work," where the teacher provided short answers to homework, seatwork, 
or tests was more frequently used in Grade 10 than the other grades. Three teachers spent an 
inordinate amount of class time so engaged. 

The three activities of "demonstration," "test or quiz," and "audiovisual" were 
observed in fewer than a quarter of the classes, perhaps because teachers knew we were coming 
at a particular time and wished to engage in some kind of "teaching" activity. In no Grade 8, 
10, or 12 class did we observe an instance of "group work/ tutoring", outside of laboratory work, 
or of "contests/learning games." 

In general, as might be expected, the student activity reflected the instructional format 
of the teacher. When teachers were lecturing, instructing or demonstrating, students were 
watching and listening. Similarly, if the instructional format was lab work, seat work, or 
discussion students were for the most part involved in these activities. 
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The cognitive level of the activity in classrooms was quite consistent across grade 
levels. About 55%-58% of time was spent receiving and recalling information or facts (level 1) 
and 40%-4?% of time was spent on concepts and skills (level 2). As for higher order thinking 
skills such as synthesis and evaluation (level 3), researchers observed these not all in Grades 8 
and 10 and only about 2% of time at Grade 12. 

These results point to a pattern of teaching across the grades that primarily involves 
lecturing, instructing, and seatwork. Despite that general characterization, some notable 
differences occurred across the three grades. We saw a narrowing of the instructional pattern as 
we moved from Grade 8 to Grade 12. As observers, it appeared to us that the most vibrant 
classes took place at the Grade 8 and 10 levels. Grade 12 classes, on the other hand, were 
marked with a palpable desire to come to grips with the material presented. A sense of fun and 
enjoyment seemed lacking here. This was work and made to seem so. We sensed a strong need to 
process a great deal of material very quickly. The single-mindedness of the enterprise was 
underscored by the impatience demonstrated by these students when the teacher withheld 
answers or ventured into territory that would not appear on examinations. In contrast, classes in 
Grades 8 and 10 showed a more leisurely pace with more time to explore and attempt different 
approaches. Although the amount of innovation and use of alternate models of teaching did not 
appear startling, when we did observe such innovation, we typically observed it at the Grade 8 
and 10 levels. 

Teacher interviews 

The interviews with teachers were analyzed and coded with respect to 12 variables of 
interest. For example, the variable "teaching approach", was generated from the interview 
question that asked teachers to talk about what was typical of their science teaching. 
Teachers who described their teaching as all lecturing or teacher instructing were assigned a 1. 
The score was incremented by one for each additional method that the teacher described as 
part of his or her repertoire. For example, a teacher who described his or her teaching as 

17 

ERIC 



16 

including direct instruction, lab work, projects, cooperative learning and peer tutoring would 

have been assigned a 5. 

The responses of teachers and their subsequent ranking on the "teaching approach" 
scale described above are shown in Figure 2. These data suggest that there are marked 
differences in the way Grades 8, 10, and 12 teachers describe their teaching approaches. The 
majority of the Grade 12 teachers were ranked as 2s. For the most part they describe their 
teaching as lecture or teacher instruction plus labs. Most of the Grade 10 teachers were ranked 
as 3's indicating that they describe their teaching as including at least one other method 
besides the traditional lecture/lab approach. Grade 8 teachers were also mostly 3's but there 
were significantly more teachers at this level who were in the 4 and 5 categories, that is, they 
describe a wide variety of teaching methods being used in their classrooms. 

The variables, the criteria for assigning numerical ratings to the interview data, and 
illustrative examples drawn from the data base are as follows: 
Science education. Based on the level of formal science education. 
Level: None Some BSc MSc PhD 

Rating: 1 2 3 4 5 

Amount of inservice. Based on the number of inservice sessions a teacher attended in the previous three 
years and the professional organizations he or she belonged to. Each inservice and organization was 
given a weight of 1 and ratings were assigned up to a maximum of 5. 
Teaching experience. Based on the number of years teaching science. 
Years teaching: 0 1-5 6-10 11-15 16+ 

Ranking: 1 2 3 4 5 

Goals for students. Based on a continuum of "practical" through to "curious". Those who 
expressed a desire to impart purely practical knowledge were assigned a score of 1 and those who 
wished that their students leave with a curious nature were given a score of 5. 

1. To impart knowledge of the environment and the human body 

2. To create an awareness of the technology around them. 
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3. To create interest and understanding. 

4. To develop in students a desire to discover their universe 

5. To instil a sense of curiosity about the world. 

Teaching approach. Based on a ranking from teacher-centred to student-centred. Teachers were 
ranked on a continuum from "le-lurer" (1) through to "facilitator" (5). 

1. Today's class was typical: an hour of lecture with ten minutes of discussion. 

2. He teaches with lectures and demonstrations. 

3. He divides his class into three equal parts: lecture, demonstration and lab. 

4. She uses oral reading, discussion, demonstration, co-operative learning and labs. 

5. He does a lot of group work and returns a question with a question rather than just giving 

students information. He believes that students often have the knowledge and that the 

teacher's role is to draw it out and help them put it together. 
Desire for change* Based on our question: "If you had the power to change anything, what would 
you do to improve science teaching?" 

1. Wants his own video machine and more human anatomy charts. 

2. Would like more field trips. 

3. The curriculum should be more flexible. 

4. Science teaching should be integrated with the other subjects. 

5. Get rid of text books, promote hands on investigation, reduce classroom size and integrate the 
sciences. 

The importance of science in the classroom. Based on teachers' responses to our question "what would be 
lost if science were taken out of the curriculum?". 

1. There would be very little lost. 

2. Something else would take its place. 

3. We could not make rational decisions about how the world ran. 

4. Science is necessary for the advancement of society. 

5. Devastating to our civilization. 
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World view. Based on teachers 1 understanding of the nature of scientific endeavour. Teachers 
who felt that science is a "search for universal truths" were ranked 1; if they also believed 
science to be a "set of processes" they were ranked 2. Teachers who felt that science is the 
"construction of models" were ranked 5; if they also believed science to be a "body of content" were 
ranked 4. Those teachers who felt that science was both a "search for universal truths" and the 
"construction of models" or that science is both a "set of processes" and a "body of content" were 
ranked 3. 

Importance of classroom testing. Based on teachers' views of classroom tests as a means of motivation, 
review, control, diagnosis, and external accountability. If they scored 3 or more with "very important", 
we gave them a rank of 5. 

1. No use seen as very important. 

2. Only one use very important 

3. Two uses very important. 

4. Three uses very important. 

5. Four or five uses very important. 

The importance of large-scale testing. Based on teachers' views of large-scale testing for the same 
purposes as the previous variable. 

1. No factor rated important or very important. 

2. One or two important factors; none very important. 

3. Three or more important factors; none very important. 

4. One or two very important factors. 

5. Three or more very important factors. 

Influence of external factors. Based on teachers' rankings of six external factors: external examinations, 
curriculum guides, textbooks, post-secondary institutions' expectations, colleagues, parental 
expectations. We looked at each teacher's first and second most influential factors. Weights were 
assigned to each factor as follows: external examinations, 4; curriculum guides, 3; text books, 2; post 
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secondary expectations, 2; colleagues, 1; parents, 1. Ranks were assigned according to the total ol 
two choices as follows: 1. three or less; 2- four; 3. five; 4. six; 5. seven. 

The total picture which included all 12 variables is shown in Figure 2. Based on these 
results, we suggest that a hypothetical teacher who represents the sample would look • 
something as follows. The individual is male, with r bachelors degree in science (although if 
teaching Grade 8 he might have a lesser qualification), he has about ten years of experience 
teaching science, probably double that if he is teaching Grade 12. Tne total number of inservice 
sessions he has attended in the previous three years or science teaching organizations he 
belongs to is around two or three. 



[Insert Figure 2 about here] 



He has no well-defined view regarding the nature of the scientific endeavour. While 
he sees science as a search for universal truths, and those truths make up its content, he also 
understands sdence to be a set of processes undertaken to construct models to explain natural 
phenomena. Consonant with this outlook, he wants to instil in his students both a practical 
knowledge of science and a drive to satisfy their curiosity in scientific matters, but if teaching 
Grade 12, he feels the more important aspect is the usefulness of the knowledge. He sees science 
as an important subject in the school curriculum, but not absolutely critical for general education; 
other subjects might be able to take up the slack and generate the skills now taught only in 
sdence. 

In his approach to teaching science, he claims to strike a balance between being 
teacher-centred and student-centred. He might say, for example, that he divides his class into 
three equal parts: demonstration, lecture, and lab. Or that he likes cooperative learning and 
often uses groups in lab work. If he teaches Grade 12, however, he is more likely to favour 
lectures and demonstrations, perhaps using lectures rather than discussions to cover the 
material. He is reasonably satisfied with the status quo. He has adequate resources for 
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teaching, although perhaps would like to have a larger classroom or fewer students or a lab 
technician to help out in preparing labs and maintaining inventory, if teaching Grade 8, he 
might like to see broader changes in curriculum such as integrating science with other subjects 
and increasing the amount of hands-on investigation. 

It is only when we consider the issue of testing and external factors that influence 
teaching and curriculum, that we must make a clear differentiation between Grade 12 teachers 
and those at the lower grade levels. Grade 12 teachers are much more influenced by external 
factors of examinations and curriculum guides, and they see large-scale testing as much more 
important and as having a much greater impact on their teaching. Finally, they view 
classroom tests as more important than do other teachers. The typical Grade 12 teacher is 
influenced almost entirely by the curriculum and examinations. Generally speaking, post- 
secondary expectations, textbooks, colleagues, and parents have little impact. He considers 
large-scale testing to be very important as a means to motivate students, review material, 
ensure student attentiveness, diagnose teaching, and ensure external accountability. The impact 
of large-scale testing has been very strong on his teaching. He may now teach fewer labs and 
lecture more than before external exams were instituted. He may have previously had students 
undertake projects in collaboration with other subject-matter teachers. He may feel that he 
must now concentrate on objectives and reduce the number of side issues students are able to 
explore or the amount of lab time in which they engage. 

Grade 12 teachers were ranked high on all four variables related to testing. Grade 10 
teachers as a whole ranked much lower, but still higher than the Grade 8 teachers. The data 
are consistent in showing the greatest effect at the Grade 12 level and the least at Grade 8. Our 
sense is that the increase in effect occurs in direct proportion to the imminence of the Grade 12 
examination. A confounding factor, of course, is the presence of the Ministry of Education's 
assessment program at the Grade 10 level. However, assessment in science occurs only once 
every four or five years, and many teachers could not recall details of the previous occurrence. 
They seemed to view it more as a nuisance, taking up class time, and did not think that the 
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students took it very seriously. In looking at the results across the grade levels, we found that 
75% of the Grade 8 teachers and 76% of the Grade 10 teachers viewed the British Columbia 
Learning Assessment Program as "not important", while 53% of the Grade 12 teachers felt this 
way. From teachers' comments it is clear that program assessment is viewed as "low-stakes" 
rather than the "high stakes" attached to the Grade 12 leaving examinations. And, these 
findings may also suggest that Grade 12 teachers, although these examo do not directly affect 
them, are inclined to view testing in general as being more important than do the other two 
groups. 

The Impact of Large-Scale Examinations on Teaching 

In this section we draw on the interviews we had with teachers, and provide samples 
of teacher and student comments in secondary classrooms. In analyzing the remarks teachers 
made about how large-scale testing had influenced their teaching, we found that at the Grades 
8 and 10 levels, teachers generally reported little influence of the government exam. However, 
when we examined the remarks from Grade 12 teachers, we found that the exams had had an 
enormous impact. Only three teachers in the first phase of our study and two in the second 
phase reported that they saw merit in the government exam; they made the following 
statements: 

I think final exams all the way through are a reasonably good idea. People complain 
that you have to teach to the exam, and you do have to, but I think it's important that 
there is a reasonable level of consistency throughout the province in terms of what 
people are taught. (Grade 8 teacher) 

Programs that I'm familiar with prior to that increase in accountability (i.e. before the 
re-introduction of Grade 12 exams),...had a lot of desirable educational outcomes, but as 
to whether the programs were achieving it or not was hard to distinguish. So as to 
accountability, I think that exams are one way of assessing that. Probably the most 
cost effective way in terms of evaluating the students as well. (Grade 10 teacher) 
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As can be seen, these teachers value the consistency, accountability and cost 
effectiveness of the final examinations. While the'third teacher who saw some merit in the 
exams was quite clear in his reasons for doing so, as can be seen from the following quotation, he 
also felt some misgivings about them, and felt that they had changed his teaching, 

I do know that standardized exams, as much as I hate to admit it, add a certain amount 
of rigor and a certain amount of determination...(they) provide focus for the students, 
especially in the later part of the year. However, I don't explore as much as I do in 
chemistry ll...because 1 want to direct their attention and these are nice students and I 
want them to do well in the government exams for their own benefit. ..a lot of them are 
really competitive and they do want to do well and you sometimes feel guilty for 
taking them away from the meat that they need for the course. (Grade 12 teacher) 

The two teachers in the second phase who found favour with the government exams 
both taught Grade 12. One thought that the exams had improved the focus of Chemistry 12 by 
narrowing it to specific topics and levels, and the other felt that both teachers and students 
were now more accountable. 

The response from the majority of teachers, however, ranged from mild ambivalence to 
strong dislike in their views of the government exams when they were specifically asked, 
"what impact have Grade 12 exams had on science teaching?" The teachers at the various 
grade levels who expressed either a mild or a strong dislike for the Grade 12 exams had wide- 
ranging reasons for their feelings. However, their objections to the exams for the most part 
centred on four concerns: that the curriculum had narrowed the scope of what was taught; that 
the various psychological pressures caused by the exams were having an negative effect; that 
their abilities to teach creatively were being seriously eroded; and finally, that some 
fundamental aspect of education was being neglected in science teaching. 

Concerns about the narrowing of the curriculum touched on several areas. Many of the 
teachers felt that they no longer taught some topics in as much depth as they once had, others 
felt that some of the more difficult, but important areas, had been dropped in order to 
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accommodate the exam, and there was 'a generally agreed perception that much of what was 
taking place in the classroom amounted to teaching how to write tests rather than teaching 
science. Among the teachers who held strongly negative views about the exam, there appeared 
to be a general perception that the curriculum had become too restrictive and had caused 
teachers to concentrate on core curriculum requirements to the exclusion of anything else. Some 
of these views can be seen in the following. 

We have really narrowed our scope, I think. Some people are going to say that we are 
teach ; ng better because we have to focus on that final exam. I would look at it the 
other way, that we're probably not teaching as well because we're being forced to do 
this, this, this, and this and we can never broaden that..,I would like the latitude to go 
further in some areas and that has been taken away. (Grade 12 teacher) 

I think they've had a great deal of impact in that they've permeated all the science 
courses with a sort of focus to a certain body of information. (Grade 10 teacher) 

I think in the senior levels, the exam is the thing that dictates how you teach and 
what you teach and how long you spend on it. (Grade 10 teacher) 

(The curriculum) has been narrowed down, dunked down and watered down. ...They 
make everything pretty simple now-the physics course is not like it was 10 years ago. 
(Grade 8 teacher) 

Another teacher who refused to teach Grade 12 because of the exams had this to say: 

If I'm back in Grade 11 and 12, I'm really concerned about the examination aspect of it. 
You cannot branch out, you can't take a topic and expand and go. (Grade 8 teacher) 

Along with curricular concerns, certain of the teachers expressed a general unhappiness 
with what they saw as psychological pressures associated with the final exams. Some 
teachers felt that the exam produced fear and that this was a poor motivator, others felt that 
pressures caused by the exams were unfair to both teachers and students, and still others 
resented the comparative aspects of the exam in which teacher and district seemed to be pitted 
against one another. This sense of being compared was resented by most teachers who objected 
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to having their teaching assessed largely on the basis of student performance which they felt 
was not always an accurate reflection of how well teachers had taught a particular course. The 
following comments illustrate these concerns. 

No I don't like being compared. ..because we are not the same. The school is not the 
same, the kids are not the same, the community is not the same.. ..(Grade 12 teacher) 

So you have got that pressure there and it changes how you function as a human being if 
you've got this thing where everybody is going to be looking at you, how your results 
are. (Grade 10 & 12 teacher) 

...in Grade 12 you have to pass those governmental exams and that looms over the 
students as a negative pressure. (Grade 10 teacher) 

Pedagogical considerations also comprised a large proportion of teacher concerns. 
Teachers felt that changes had taken place all the way down the line; that the Grade 12 exam 
was having an impact at all grade levels. Some teachers felt that government exams had 
reduced the opportunities for spontaneity and depth, others complained that because of time 
constraints imposed by the exams, they were not able to conduct as many labs as they would 
like. Many of these teachers felt that "the luxury of getting side-tracked" had been taken 
away from them and that a subtle change had taken place toward a more content oriented 
delivery which discouraged both student and teacher creativity. Emerging from these 
comments is a general feeling that too much class time was being spent in preparing students to 
write the Grade 12 exams and that students themselves "tune out" if they know that wha* is 
being discussed will not be tested. One teacher complained that the exams "reduced the number 
of unusual and interesting things I am able to do", and several said that they would lecture less 
if there were no exams. Two of their comments follow: 

The government exams are a B.C. competition game you are playing and you can't afford 
to be 'down there* just because you went wool-gathering for a month-relevant as that 
might have been. (Grade 8 and 12 teacher) 
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(Before the reintroduction of government exams) I used to do more project work, more 
library work, more open-ended work and programs involving the community. (Grade 10 
teacher) 

Another teacher appeared to sum up a general feeling by saying: "...we now learn how to teach 
towards the exam. In other words, it is teaching us, the teachers." 

The above attitudes speak for themselves and must be seen to represent some serious 
problems in science teaching. But, perhaps the most unsettling concern reported by these 
teachers involves what they see as an erosion in the kind of science that is being taught. Many 
of them felt that science classes had become "content oriented" and that they had been reduced 
to little more than the presentation of a string of facts to be memorized for the final exam. Even 
teachers who were ambivalent about the exam agreed that since its reintroduction teachers 
had been forced to "concentrate on objectives, to concentrate on facts." One Grade 12 teacher 
stated that the government exam had eroded the ideals of good science teaching which 
included developing a sense of curiosity and a sense of social responsibility. Another felt that 
in limiting the teacher's ability to cover areas they are keen on, the students miss the 
enthusiasm and energy that the teacher brings to these topics and which often "rub off on the 
students. One teacher summed up his views this way: 

I think they (the government) are trying to upgrade teaching by providing a really 
severe exam. And, as a result, you have some teachers who maybe aren't that good as 
teachers but spend the entire year with a bank of exams, going over and over them and 
the kids maybe haven't learned anything. But, for a certain body of questions they 
(teachers) are great. But in terms of high level learning and their attitude toward 
learning and all the other things that education is about, they really aren't good 
teachers at all. (Grade 10 teacher) 

Another interesting aspect to the question of these exams involved the quality of the 
exams themselves. Teachers appeared to be divided on the subject of whether the largely 
multiple-choice exam proved to be a fair indication of students' knowledge and whether the 
exam itself contributed to what many see as a largely content driven curriculum. The teachers 
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who felt that multiple-choice questions were appropriate, did so generally on the basis of 
multiple-choice exams being easy to mark. Many of these teachers also felt that it was 
appropriate to teach students how to write exams because they would be faced with them when 
they reached university. Those who opposed this type of question felt that the exam often 
"manipulated students into making process errors"; and that they failed to test the higher 
cognitive levels. Some of the teachers in this camp also felt that exams motivated students to 
write exams rather than motivating learning and that they discouraged students who had an 
interest in science but who did not want to face final exams. As evidence of this last 
consideration, on several occasions the observers were told about schools or districts that 
discouraged marginal students from taking science classes in order that the district average 
might be kept high. 

While by far the majority of what we saw taking place in B.C. classrooms, and what 
was reported to us by the Grade 12 teachers themselves, suggest that final exams are indeed 
having an enormous impact on science teaching, there were contrary examples. Two of the 16 
Grade 12 teachers we observed in the second phase represented exceptions or anomalies to the 
general perception that the Grade 12 final exams were driving what takes place in the 
classroom. These two teachers felt so strongly about what they view as good science teaching, 
that they took it as a point of pride not to let the final exam influence them. In other words, 
they refused to teach to the test. One of these teachers acknowledged that students in his class, 
like most Grade 12 students, were unwilling to spend any amount of class time discussing or 
considering issues that would not be tested. To get over this hurdle, he told the researcher that 
he makes it a point never to answer the inevitable question, "will this be on the test?" Instead 
he takes a professional pride in including material that he deems worthwhile and in allowing 
students time for peripheral discussion around any of the topics raised. He encourages students 
to ask "dumb" questions. One of his teaching strategies involves opening an astronomy class by 
telling the students they have 10 minutes to ask him anything they have ever wondered about 
astronomy. "Well," he said, "I knew what was going to happen, one hour later we were still 
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discussing it." This teacher places considerable importance on teaching a unit on radioactivity, 
and although it does not appear on the government exam he continues to teach it nonetheless. 
He told the researcher: 

I have a super lesson on radioactivity which includes some wonderful demos and lab 
work and great library projects. Because a lot of us don't like to teach it, they don't 
want it on the government exam. 'I don't care about your exam, I'm going to teach it 
anyway; my kids will do whatever they do.' 

Like the teacher just mentioned, the second teacher also holds strong views on what 

should be taught in science. He, too, feels a certain pride in doing what he thinks is important. 

For him this means being willing to forego the certainty of high marks in favour of the luxury of 

discovery. He continues to teach thermodynamics although it is no longer required by the 

curriculum. An excerpt from the researcher's notes after listening to the tape of the interview 

with this teacher illuminates some of his thinking: 

X claims he had better ability to get students to enjoy science before the advent of 
exams-he says he hates to see kids stopped from exploring. He feels that the 
curriculum is the #1 influence on teaching and that it is highly tied to government 
exams, which he picked as #2 (influence). He says that if he wants to cover certain 
material he doesn't tell the students that it will be a non-testable item because he 
knows that if he says, "you will not be tested on this", the kids tune out. He thinks 
government exams have had an enormous influence on science teaching. He feels they 
are politically motivated, have taken away some tools that teachers used to use, and 
give students enormous pressure which he sees as not being conducive to learning. He 
says he still tries to do what he think important and tries not to be swayed by the 
exams. 

Interestingly both these teachers claim to produce students who achieve excellent 
results on the government exam. And while they don't feel that the exams have affected their 
teaching, as can be seen from their comments, they do feel that they have had an impact on 
science teaching in general and on student attitudes. 
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Discussion 

In attempting to answer the question, "what has been the impact of large-scale testing 
on the instructional practice of teachers?", we also identified factors other than large-scale 
testing that had had some influence on teaching. The data showed that final examinations at 
the Grade 12 level did have a strong effect on teaching at that level. This conclusion comes 
both from what teachers and students told us and what we observed in classrooms. What 
meaning then does one ascribe to this impact? In this section of the report, we attempt to 
address this question. 

From our analysis of this question it became evident that the impact of the final 
examinations currently used in Grade 12 in this province goes well beyond what may have been 
originally intended by those in the Ministry who initiated them. Whether these effects of the 
policy of reinstitu ting. final exams represent improvement in the schools or a step backward 
depends on the perspective one takes about value in education. One thing becomes quite clear 
from this analysis: the final Grade 12 examinations as they are conducted in the Province of 
British Columbia are not mere indicators of performance. 

We begin the discussion by pointing to what we see as an overall readjustment in the 
way science teaching, particularly at the Grade 12 level, has been viewed in the province and 
the effect final examinations have had in the process, using what we call "the pincer 
movement" as a metaphor to describe what takes place. We then focus more specifically on 
teaching itself and return to our brief review or science teaching to point to some of the 
limitations that large-scale testing has placed upon implementing the more innovative 
practices in science teaching. The third area of discussion moves beyond the immediate 
classroom to the larger issues of reform in education and places large-scale testing in that arena. 
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Limits to Cuiriculum Decision-Making: The Pincer Movement 

Different models exist concerning who ought to make curriculum decisions, who 
determines teaching approaches, and essentially who becomes the final decision-maker in the 
curriculum process. On the one hand, the objectives of the curriculum, the ways and means of 
achieving those objectives, and their evaluation could be left entirely to the teacher within 
the general guidelines set out by the Ministry. Here the teacher can exercise discretion with 
regards to the objectives of instruction, the ways and means of achieving those objectives, and 
the evaluation procedures to assess them. This scenario does exist in many jurisdictions in the 
Western World. In Dutch and Scandinavian countries, for example, teachers do have such 
freedom (Van Der Gegt & Knip, 1990). Teachers in many primary schools in our own province 
perceive themselves to have that freedom, and in fact do exercise it. On the other hand, one 
could set the learning outcomes a, id give teachers the responsibility to teach in whatever 
manner they wish to achieve those objectives. 

This latter scenario has driven curriculum makers in the province of British Columbia 
at the secondary level for the past several years. Learning outcomes have been set by 
curriculum committees, but the ways of achieving those learning outcomes and their evaluation 
have been left largely to schools and teachers. The junior secondary programs, for example, 
provide a curriculum and a textbook which, because of its length, requires the teacher to make 
choices of topics and to treat them however he or she wishes. The professional autonomy of 
teachers then remains intact: they have objectives, they have materials from which to choose 
learning experiences, but how they go about that and how they evaluate their students is left to 
their own professional judgement. In practice this has led to some diversity, but many teachers 
appear to follow the textbooks provided quite closely. This scenario, however, has allowed 
some teachers to try alternate approaches and to experiment with innovative practices as we 
have seen in this study, in Grade 8 and 10 classrooms in particular. 
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The policy of instituting final examinations at the Grade 12 level in the Province of 
British Columbia in 1983 has had an additional effect upon this model of curriculum planning. 
The fixed model of evaluation imposed at Grade 12 adds another level of constraint on this 
curriculum model This move becomes tantamount to creating a pincer movement on curriculum 
implementation. The fixed learning outcomes provide one part of the pincer and the final 
examination the other. Teachers now have very little room to manoeuvre either the learning 
outcomes of the curriculum or the evaluation side. Thus, their movements are restrained on 
either side leaving little or no freedom for the teacher with regards to the ways and means of 
instruction. 

This added restriction then raises questions about curriculum decision-making and 
power in education. If one accepts the arguments that have just been made, which our data 
strongly support, then it becomes clear that teachers in Grade 12 have lost much of their 
discretion in terms of curriculum decision-making. Final examinations "tave shifted the power 
of the curriculum to that group who prepare the final examinations. Teachers are no longer 
implementors of curriculum, but only those who deliver it. 

The further question is whether this development represents a positive move in science 
teaching or a negative one. On the positive side, it has identified the content on which 
students leaving high schools should be examined. That was one of the original reasons given 
for implementing the policy in 1983. On the negative side, the policy has undermined the 
notion of teacher as autonomous professional. We also find large-scale testing raising 
important questions about both teaching and general reform in education, issues to which we 
turn in the sections that follow. 

Science Teaching 

We assume that "good" science teaching should reflect different approaches to 
teaching, authenticity, intrinsic engagement, and a constructivist approach; in general it 
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should aim at developing an educated person. We comment briefly on each of these and discuss 
how large scale-testing may or may not have furthered those notions. 

The curriculum reform movement and developments since have provided educators with 
an array of teaching models from which learning experiences can be developed. Taken ■ 
together, they represent more that just a bigger cafeteria of choices. As Ausubel (1963) pointed 
out some years back, the need exists for different types of learning processes because each 
promotes different educational objectives, all of w uch become necessary to the process of 
education. Problems, he argued, can arise with any approach to instruction, not because it is 
inherently bad, but because it is used for the wrong purposes. Joyce,Weil, and Showers (1992) 
have since developed that argument much further through their work around models of 
teaching. They argue against dogmatism in style, positing that competence in teaching comes 
through extending one's ability to use different models of teaching. They base this argument on 
the notion that no single model can achieve the range of objectives thought to be necessary or 
expected. Like others, they argue, for example, that student understanding of the heuristics of 
a discipline and the development of attitudes are not necessarily achieved through a didactic 
approach. 

Large-scale testing does not encourage teachers to use different approaches to teaching. 
It appears to have narrowed the range of instructional practices being used in Grade 12 
compared to Grades 8 and 10, and indeed it may also have narrowed the purposes of education 
itself. Three things appear to be happening as a result of large-scale testing which work 
against the use of different approaches to teaching. The first involves a time factor. Teachers 
regularly reported not having time to use any strategy other than the direct teaching we 
reported earlier. Field trips, for example, simply took too much time away from class; teachers 
reported having reduced such trips since final examinations had been brought back. Given this 
restraint, it becomes difficult to imagine a teacher wanting to take the time to experiment with 
co-operative learning or inquiry teaching. Second, the students do not wish to undertake any 
activity that would distract them from the task of preparing for the examinations. Earlier we 
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spoke of the "no nonsense" atmosphere in most Grade 12 classrooms we saw.- Although we may 
see this attitude as a positive factor because it produces on-task behaviour, we can also see that 
it produces a focus on the content of the curriculum to be examined at the end of the year. Other 
outcomes, which may also be an important part of the education of children, take on relatively 
less importance when teachers are encouraged to focus on the content of the curriculum- A third 
factor involved the examinations themselves; success on them depends primarily upon 
committing to memory the content of the curriculum and algorithms required to solve set 
problems likely to appear on the exam. This technique underscores the mechanics of test 
writing rather than promoting any clear understanding of science concepts. 

How has the reintroduction of final examinations at the Grade 12 level affected the 
authentic nature of science teaching? We saw little inquiry teaching in the Grade 12 classrooms 
we observed, and only a few instances in Grades 8 and 10. One can hardly attribute this 
situation to the existence of final Grade 12 examinations alone. Many other factors augur 
against the use of inquiry teaching in classrooms. We identified several of these during our 
observations and interviews. The curriculum itself, with its heavy emphasis on content, did not 
appear to encourage the use of this particular strategy. We also gained a sense from our 
interviews that as a strategy, inquiry teaching had not been part of the vernacular of teaching 
over the last few years. At the district level, we found that in the few districts where any 
consideration was being given to science teaching, the emphasis was on building a pool of items 
with which to improve assessment. We could speculate that the reintroduction of final 
examinations has changed the professional agenda of the teaching force regarding how they 
maintain and improve their teaching. In the 1950s and 1960s that agenda focused upon the 
curriculum and the teaching of that curriculum. Today we see the use of alternate teaching 
strategies such as inquiry which had been part of that agenda set aside as testing and 
evaluation have taken over. 

The question as to whether an emphasis on examinations has improved students' 
understanding in terms of knowing the structure of the discipline in the sense that Bruner (1961) 
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described it, remains open to debate. Our analysis produces mixed views. On the one hand, 
Grade 12 testing appears to have focused the attention of both teachers and students on a 
specified area of content, and the degree to which the examinations reward that understanding 
is the degree to which students are assumed to understand the subject. This argument is 
countered by the observation that examinations also particularize knowledge to an extent that 
the isolated parts are rarely brought together. The time to sit back, contemplate, and come to 
understand what a subject area means does not appear to be occurring in the present Grade 12 
dp .rooms. In the words of one Grade 12 student, 

To understand physics you need to play with it, and have fun with it, In school 
w r e never do that. 

One of the notions that received attention during the period of curriculum reform, and 
one that has been emphasized since, involves student engagement. The concept takes various 
forms. First, there was the notion of on-task behaviour that emerged from the research on 
teaching movement; those students who remain on-task produce better achievement. More 
recently, the concept of active learning has come to the forefront. At one time in science that 
concept meant 'hands-on' activity; if students were handling materials that w T as better than if 
they were listening. This rather simplistic notion has given rise to a more thoughtful view of 
of active learning that does include hands-on activity, but in addition encourages engagement of 
students in classroom discourse and even in the selection of learning activities in some cases. 

The question as to how the introduction of final Grade 12 examinations affected student 
engagement becomes a difficult one to answer. In some Grade 12 classrooms, because of the press 
of content to be learned, students were found to be on-task in ways that were not evident in 
Grades 8 and 10. However, dnce Grade 12 students are a select group, that outcome may have 
been expected. What we did not see in most of the classrooms to any great extent in the 
lecture/discussion part of the lessons were discussions and arguments among students about the 
concepts they were learning. Verification remained something for the teacher to do; students 
became the audience, not of each other, but of the teacher. Virtually no student-student 



ERLC 



35 



interaction occurred in the classrooms we observed. The laboratories provided a different 
opportunity and setting; but here too, the dialogue centred around verification of well 
programmed results. Where we did find exceptions they generally occurred in Grades 8 and 10. 

Many contend that constructivism as it applies to science in the 1990s bears many of the 
characteristics of inquiry learning of the 1960s. We both agree and disagree. Certainly, many 
of the instructional approaches carry strong similarities; the general intentions appear 
somewhat similar. But we find the constructivist to offer a much more grounded position and in 
some ways a more radical one, depending on how far one wishes to carry the concept. While 
inquiry learning of the 1950s typically provided a more meaningful way of developing the 
conceptual side of the discipline, constructivism frequently challenges its foundation and how it 
is taught. 

MacKinnon (1989) recently summarized three principles that follow for science 
teaching from constructivism. First, he contends that teachers must first develop strategies 
that will allow them to become aware of the ideas held by their students about science concepts 
as these are derived from their ordinary language. Second, these ideas must be taken into 
account during instruction so as to construct new concepts. Third, students should be actively 
involved in the learning situation in ways that we describe in the previous section. Under this 
view, learning shifts from the 'empty vessel 1 notion to conceptual change which occurs only 
when students experience dissatisfaction with existing concepts through anomalies and puzzles, 
develop new conceptions, and see the plausibility of these. Constructivism points to the 
difficulty of learning new scientific concepts because one's ordinary explanations of phenomena 
usually suffice. 

We saw little evidence that teachers had any awareness of the notions of 
constructivism. Moreover, while some remnants of the three principles just outlined did appear 
in Grades 8 and 10, the teaching in most Grade 12 classrooms appeared based on a very different 
model of teaching and learning from that suggested by MacKinnon. It appeared that the 
authority for knowing and accepting information came primarily from the teacher and the 
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textbook. Conceptual change did not enter into the picture; in fact, pausing to become aware of 
the students' conceptions of phenomena became problematic because it took too much time. In 
conversations with a teacher who attempted to work from a constructivist approach in Grade 
12, he cited the students' negative reaction to it as a major hurdle. Basically, students wanted 
the factual information that would be useful on the examination; they did not wish to be 
bothered with understanding. 

Change in the Schools 

How change occurs within a social system has occupied a number of social scientists over 
the years. The work of two groups provides a background for the discussion that follows. 
Baldridge (1972) has contrasted two perspectives: human relations and political systems. The 
human relations perspective is social-psychological in nature, focusing on the individual and 
peer group relations. Chinn and Benne's description of "normative reeducative" approach to 
change illustrates this perspective (Bennis, Benne, Chinn, & Corey, 1976). It assumes persons to 
be inherently active, in pursuit of impulse and needs satisfaction; change here comes through 
altering norms and acquiring changed understandings. The human relations approach to change 
reflects what has commonly been referred to as the "bottom up" approach to change in a social 
system. Teachers seeking to make change within their own school would fall under this general 
approach. Baldridge contrasts this with the political systems approach to change which 
emphasizes the dynamic features of the organization, assumes that conflict is natural, that 
power elites govern most decisions, and that external groups have a great deal of influence. The 
introduction of large-scale testing examined in this study represented a political perspectives 
approach to change, one which involved the exercise of power as a means to effect change. It 
came despite little evidence that it was needed and against the wishes of the teachers' 
organization in the province. The question of interest in this study was how the districts, 
schools, and teachers changed to accommodate the introduction of that policy. 
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This study provided an opportunity to examine both the change process itself as well as 
some of the actual changes the were brought about by the policy decision to reintroduce Grade 
12 final examinations. 

The introduction of large-scale testing came as a mandated change in the province. To 
use the terms of Bennis, Benne, Chin & Corey (1976), it came as a "power coercive" move on the 
part of the Ministry where teachers had little choice but to comply with the initiative unless 
they decided to change the grade they taught. Once in place, however, certain other effects 
began to be experienced. The preparation and marking of the examinations began to be seen 
with some favour among teachers who described that experience as wonderful inservice for 
them, and also it gave them the inside track for preparing their students for the coming year. 
The camaraderie and networking that developed around the technology of processing the 
examinations had a positive effect upon the teachers involved quite apart from any effects 
upon their instructional practices. The policy decision also had an effect upon how teachers 
taught science, and how they began to view professional development. 

But as we indicated earlier in this section, the final Grade 12 examinations have 
become more than mere indicators of student performance; their effect can be felt beyond the 
classroom, raising issues of a broader educational nature. The issue of social change can be seen 
also if we take a more global perspective. For this examination, we need to step back and look 
at the high school in a more general and global way. 

A research team from Simon Fraser University recently reported the results of an 
extensive review of literature and a follow-up study of 16 high schools in a large school district 
in the Greater Vancouver area (Wideen, Pye, Naylor & Crofton, 1990). The literature review 
pointed to the high schools as an area in considerable difficulty with high dropout rates and 
increasing numbers of students feeling at risk. The dropout rate between Grades 8 to 12 ranged 
from 30% to 60 % among schools in British Columbia. The follow-up study to that literature 
review pointed to the teaching that takes place in the secondary school as the major problem 
for those students who dropped out and also for those who remained. Lecturing, seatwork and 
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recitation dominated the textbook driven instruction in those 16 secondary schools. The 
question that master planners will have to face soon is just how reform can be brought about in 
the high schools. Within this context we then place the practice of final Grade 12 
examinations. Both our data and our interpretations of that data suggest that final 
examinations are having the effect of effectively freezing current practices at the Grade 12 
level. The opportunity or motivation for teachers to attempt changes to their practice, a 
necessary condition some would posit for reform, is effectively nullified by the practice of final 
examinations. 

We make this claim based on two observations, the first coming from what people told 
us about the impact of final exams, and the second from what we observed in the way of 
teaching. As we noted earlier, several teachers at the Grades 8 and 10 level reported that the 
Grade 12 examination had little or no effect on their teaching. Grade 12 teachers consistently 
told us the opposite, that such examinations were affecting what they did in the classroom. 
From the data collected during classroom observations, we would be inclined to agree with 
these teachers. When we observed their practice, almost without exception, any indication of 
exemplary science teaching that we witnessed took place in the lower grades. For example, in 
one Grade 8 class, the making of peanut butter was used to demonstrate chemical change; in a 
class in genetics, Grade 10 students explored changes in a developing fetus by writing essays in 
which the students viewed the process through the senses of the fetus. On the other hand, the 
majority of straight lecturing and non-student involvement in class took place in Grade 12 
classrooms. An emphasis on tests and testing appeared to dominate many of these classes and 
the most prevalent student question we heard there was, "will this be on the exam?" It seemed 
to be a common practice that the last three or four weeks of Grade 12 classes would be devoted to 
going over old exams and attempting to review and predict what the form and substance might 
be for the coming one. 

If our contention is valid that Grade 12 examinations have effectively frozen 
innovative practice in Grade 12, then it can be argued that that fact will in the long run augur 
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against improved practice in the high school. This contention rests on the belief that such 
improvement can only come from teachers who feel some freedor to experiment with their 
practice. Here a normative/reeducative approach to change as described by Chinn and Benne 
(Bennis, et al., 1976) would ensue. In the present system where the main preoccupation appears 
to be content coverage and a focus upon the evaluation instruments themselves, it is very 
difficult to imagine anyone venturing into any type of innovative practice. For some this 
situation might not suggest a problem. But if you take the view that the high school presently 
is an institution badly in need of reform (see for example, Goodlad, 1984; Sewall, 1983; Van Til, 
1976) and further speculate that teachers require some freedom to experiment and take risks to 
bring about this reform, then the existence of large-scale testing as currently practiced 
effectively reduces the chances that such change will occur. 
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Table 1 

Science Teachers' Instructional Format 
(mean percentage of class time) 
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Figure 1. Box-and-whiskers plots showing science teachers' instructional 
format 
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Figure 2: Summary of Phase II Teacher Profiles. 
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